Search CORE

25 research outputs found

Robust Large Margin Deep Neural Networks

Author: Giryes R
Rodrigues MRD
Sapiro G
Sokolic J
Publication venue: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Publication date: 23/05/2017
Field of study

The generalization error of deep neural networks via their classification margin is studied in this paper. Our approach is based on the Jacobian matrix of a deep neural network and can be applied to networks with arbitrary nonlinearities and pooling layers, and to networks with different architectures such as feed forward networks and residual networks. Our analysis leads to the conclusion that a bounded spectral norm of the network's Jacobian matrix in the neighbourhood of the training samples is crucial for a deep neural network of arbitrary depth and width to generalize well. This is a significant improvement over the current bounds in the literature, which imply that the generalization error grows with either the width or the depth of the network. Moreover, it shows that the recently proposed batch normalization and weight normalization reparametrizations enjoy good generalization properties, and leads to a novel network regularizer based on the network's Jacobian matrix. The analysis is supported with experimental results on the MNIST, CIFAR-10, LaRED, and ImageNet datasets

arXiv.org e-Print Archive

UCL Discovery

Greedy-Like Algorithms for the Cosparse Analysis Model

Author: Baraniuk
Blumensath
Blumensath
Blumensath
Candès
Candès
Candés
Dai
Daubechies
Davis
Donoho
Donoho
Elad
Foucart
Garg
Giryes
Gribonval
Krahmer
Lu
M. Elad
M.E. Davies
Mallat
Mendelson
Mo
Nam
Needell
Peleg
R. Giryes
R. Gribonval
Rauhut
Rubinstein
S. Nam
Tibshirani
Vaiter
Zhang
Publication venue: 'Elsevier BV'
Publication date: 18/01/2013
Field of study

The cosparse analysis model has been introduced recently as an interesting alternative to the standard sparse synthesis approach. A prominent question brought up by this new construction is the analysis pursuit problem -- the need to find a signal belonging to this model, given a set of corrupted measurements of it. Several pursuit methods have already been proposed based on

\ell_1

relaxation and a greedy approach. In this work we pursue this question further, and propose a new family of pursuit algorithms for the cosparse analysis model, mimicking the greedy-like methods -- compressive sampling matching pursuit (CoSaMP), subspace pursuit (SP), iterative hard thresholding (IHT) and hard thresholding pursuit (HTP). Assuming the availability of a near optimal projection scheme that finds the nearest cosparse subspace to any vector, we provide performance guarantees for these algorithms. Our theoretical study relies on a restricted isometry property adapted to the context of the cosparse analysis model. We explore empirically the performance of these algorithms by adopting a plain thresholding projection, demonstrating their good performance

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Edinburgh Research Explorer

Hal-Diderot

HAL-Rennes 1

Low Complexity Regularization of Linear Inverse Problems

Author: A Girard
A. Barron
A. Beck
A. Beck
A. Chambolle
A. Chambolle
A. Daniilidis
A. Montanari
A.N. Tikhonov
A.N. Tikhonov
A.S. Bandeira
A.S. Lewis
A.S. Lewis
A.S. Lewis
A.S. Lewis
B. Efron
B. Efron
B. Recht
B.A. Turlach
B.C. Vũ
B.D. Rao
B.F. Svaiter
B.K. Natarajan
B.S. Mordukhovich
C. Chaux
C. Deledalle
C. Dossal
C. Dossal
C. Lemaréchal
C. Vonesch
C.-A. Deledalle
C.-A. Deledalle
C.L. Mallows
C.M. Stein
D. Gabay
D. Gabay
D. Gross
D. Needell
D. Ville Van De
D. Ville Van De
D.A. Lorenz
D.A. Spielman
D.L. Donoho
D.L. Donoho
D.L. Donoho
D.L. Donoho
E. Grave
E. Hale
E. Harchaoui
E. J. Candès
E. Richard
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
F. Bach
F. Bach
F. Luisier
F. Santosa
G. Chen
G. Davis
G. Obozinski
G. Peyré
G. Steidl
G.B. Passty
G.H. Golub
H. Akaike
H. Jégou
H. Jégou
H. Raguet
H. Zou
H.H. Bauschke
H.H. Bauschke
H.L. Taylor
H.M. Hudson
I. Daubechies
J. Allen
J. Bolte
J. Chen
J. Douglas
J. Eckstein
J. Eckstein
J. Eckstein
J. Mairal
J. Tropp
J. Ye
J.-B. Hiriart-Urruty
J.-C. Pesquet
J.-F. Aujol
J.-J. Fuchs
J.-J. Fuchs
J.-L. Starck
J.-L. Starck
J.A. Tropp
J.C. Dunn
J.E. Vogt
J.F. Claerbout
J.M. Lee
K. Bredies
K. Bredies
K. Bredies
K. Kato
K. Knight
K.-C. Li
L. Birgé
L. Birgé
L. Borup
L. Condat
L.I. Rudin
L.M. Briceño Arias
M Coste
M. Elad
M. Elad
M. Fazel
M. Fortin
M. Frank
M. Golbabaee
M. Grasmair
M. Jaggi
M. Meyer
M. Rudelson
M. Yuan
M.J. Wainwright
M.R. Osborne
M.V. Solodov
N. Parikh
O. Scherzer
P. Hall
P. Hall
P. Tseng
P. Zhao
P. Zhao
P.L. Combettes
P.L. Combettes
P.L. Combettes
P.L. Lions
R. Ciak
R. Giryes
R. Glowinski
R. Gribonval
R. Gribonval
R. Gribonval
R. Gribonval
R. Jenatton
R. Jenatton
R. Refregier
R. Tibshirani
R. Tibshirani
R.J. Tibshirani
R.J. Tibshirani
R.L. Dykstra
R.T. Rockafellar
S. Nam
S. Negahban
S. Ramani
S. Ramani
S. Shalev-Shwartz
S. Vaiter
S. Vaiter
S.F. Cotter
S.G. Lingala
S.G. Mallat
S.G. Mallat
S.G. Mallat
S.J. Wright
S.N. Negahban
S.P. Boyd
S.S. Chen
T. Blu
T. Blumensath
T. Strohmer
T.T. Cai
T.T. Cai
T.T. Cai
V. Chandrasekaran
V. Duval
V. Solo
W.L. Hare
X. Shen
Y. Castro de
Y. Censor
Y. Chen
Y. Lyubarskii
Y. Nesterov
Y. Nesterov
Y.C. Eldar
Y.C. Pati
Publication venue
Publication date: 01/01/2014
Field of study

Inverse problems and regularization theory is a central theme in contemporary signal processing, where the goal is to reconstruct an unknown signal from partial indirect, and possibly noisy, measurements of it. A now standard method for recovering the unknown signal is to solve a convex optimization problem that enforces some prior knowledge about its structure. This has proved efficient in many problems routinely encountered in imaging sciences, statistics and machine learning. This chapter delivers a review of recent advances in the field where the regularization prior promotes solutions conforming to some notion of simplicity/low-complexity. These priors encompass as popular examples sparsity and group sparsity (to capture the compressibility of natural signals and images), total variation and analysis sparsity (to promote piecewise regularity), and low-rank (as natural extension of sparsity to matrix-valued data). Our aim is to provide a unified treatment of all these regularizations under a single umbrella, namely the theory of partial smoothness. This framework is very general and accommodates all low-complexity regularizers just mentioned, as well as many others. Partial smoothness turns out to be the canonical way to encode low-dimensional models that can be linear spaces or more general smooth manifolds. This review is intended to serve as a one stop shop toward the understanding of the theoretical properties of the so-regularized solutions. It covers a large spectrum including: (i) recovery guarantees and stability to noise, both in terms of

\ell^2

-stability and model (manifold) identification; (ii) sensitivity analysis to perturbations of the parameters involved (in particular the observations), with applications to unbiased risk estimation ; (iii) convergence properties of the forward-backward proximal splitting scheme, that is particularly well suited to solve the corresponding large-scale regularized optimization problem

arXiv.org e-Print Archive

HAL - Normandie Université

CiteSeerX

Base de publications de l'université Paris-Dauphine

Crossref

Deep Randomized Neural Networks

Author: A Gorban
A Rodan
A Sperduti
B Igelnik
B Igelnik
B Schrauwen
B Widrow
C Gallicchio
C Gallicchio
C Gallicchio
C Gallicchio
C Gallicchio
C Gallicchio
D Albers
D Verstraeten
D Wang
D Wang
D Wang
F Cao
F Rosenblatt
G Manjunath
GX Yuan
H Jaeger
I Farkaš
I Yildiz
J Boedecker
JT Lizier
L Livi
L Oneto
L Zhang
M Alhamdoosh
M Li
M Lukoševičius
M Ozturk
N Bertschinger
P Frasconi
P Tino
P Tino
R Giryes
R Legenstein
T Schreiber
T Strauss
TH Chan
W Maass
X Sun
Y LeCun
Y Pao
YH Pao
Publication venue
Publication date: 01/01/2020
Field of study

Randomized Neural Networks explore the behavior of neural systems where the majority of connections are fixed, either in a stochastic or a deterministic fashion. Typical examples of such systems consist of multi-layered neural network architectures where the connections to the hidden layer(s) are left untrained after initialization. Limiting the training algorithms to operate on a reduced set of weights inherently characterizes the class of Randomized Neural Networks with a number of intriguing features. Among them, the extreme efficiency of the resulting learning processes is undoubtedly a striking advantage with respect to fully trained architectures. Besides, despite the involved simplifications, randomized neural systems possess remarkable properties both in practice, achieving state-of-the-art results in multiple domains, and theoretically, allowing to analyze intrinsic properties of neural architectures (e.g. before training of the hidden layers' connections). In recent years, the study of Randomized Neural Networks has been extended towards deep architectures, opening new research directions to the design of effective yet extremely efficient deep learning models in vectorial as well as in more complex data domains. This chapter surveys all the major aspects regarding the design and analysis of Randomized Neural Networks, and some of the key results with respect to their approximation capabilities. In particular, we first introduce the fundamentals of randomized neural models in the context of feed-forward networks (i.e., Random Vector Functional Link and equivalent models) and convolutional filters, before moving to the case of recurrent systems (i.e., Reservoir Computing networks). For both, we focus specifically on recent results in the domain of deep randomized systems, and (for recurrent models) their application to structured domains

arXiv.org e-Print Archive

Crossref

Archivio della Ricerca - Università di Pisa

Deep Randomized Neural Networks

Author: A Gorban
A Rodan
A Sperduti
B Igelnik
B Igelnik
B Schrauwen
B Widrow
C Gallicchio
C Gallicchio
C Gallicchio
C Gallicchio
C Gallicchio
C Gallicchio
D Albers
D Verstraeten
D Wang
D Wang
D Wang
F Cao
F Rosenblatt
G Manjunath
GX Yuan
H Jaeger
I Farkaš
I Yildiz
J Boedecker
JT Lizier
L Livi
L Oneto
L Zhang
M Alhamdoosh
M Li
M Lukoševičius
M Ozturk
N Bertschinger
P Frasconi
P Tino
P Tino
R Giryes
R Legenstein
T Schreiber
T Strauss
TH Chan
W Maass
X Sun
Y LeCun
Y Pao
YH Pao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Randomized Neural Networks explore the behavior of neural systems where the majority of connections are fixed, either in a stochastic or a deterministic fashion. Typical examples of such systems consist of multi-layered neural network architectures where the connections to the hidden layer(s) are left untrained after initialization. Limiting the training algorithms to operate on a reduced set of weights inherently characterizes the class of Randomized Neural Networks with a number of intriguing features. Among them, the extreme efficiency of the resulting learning processes is undoubtedly a striking advantage with respect to fully trained architectures. Besides, despite the involved simplifications, randomized neural systems possess remarkable properties both in practice, achieving state-of-the-art results in multiple domains, and theoretically, allowing to analyze intrinsic properties of neural architectures (e.g. before training of the hidden layers’ connections). In recent years, the study of Randomized Neural Networks has been extended towards deep architectures, opening new research directions to the design of effective yet extremely efficient deep learning models in vectorial as well as in more complex data domains. This chapter surveys all the major aspects regarding the design and analysis of Randomized Neural Networks, and some of the key results with respect to their approximation capabilities. In particular, we first introduce the fundamentals of randomized neural models in the context of feed-forward networks (i.e., Random Vector Functional Link and equivalent models) and convolutional filters, before moving to the case of recurrent systems (i.e., Reservoir Computing networks). For both, we focus specifically on recent results in the domain of deep randomized systems, and (for recurrent models) their application to structured domains

Crossref

Archivio della Ricerca - Università di Pisa

Iterative signal recovery from incomplete samples

Author: Giryes R.
Michael Elad
Raja Giryes
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Generalised non‐locally centralised image de‐noising using sparse dictionary

Author: Coumar S.O.
Giryes R.
Rubinstein R.
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date
Field of study

Crossref

Generalization Error of Invariant Classifiers

Author: Giryes R
Rodrigues MRD
Sapiro G
Sokolic J
Publication venue: 20th International Conference on Artificial Intelligence and Statistics (AISTATS)
Publication date: 01/01/2017
Field of study

This paper studies the generalization error of invariant classifiers. In particular, we consider the common scenario where the classification task is invariant to certain transformations of the input, and that the classifier is constructed (or learned) to be invariant to these transformations. Our approach relies on factoring the input space into a product of a base space and a set of transformations. We show that whereas the generalization error of a non-invariant classifier is proportional to the complexity of the input space, the generalization error of an invariant classifier is proportional to the complexity of the base space. We also derive a set of sufficient conditions on the geometry of the base space and the set of transformations that ensure that the complexity of the base space is much smaller than the complexity of the input space. Our analysis applies to general classifiers such as convolutional neural networks. We demonstrate the implications of the developed theory for such classifiers with experiments on the MNIST and CIFAR-10 datasets.Comment: Accepted to AISTATS. This version has updated reference

arXiv.org e-Print Archive

UCL Discovery

Z2P: Instant Visualization of Point Clouds

Author: Cohen-Or D
Giryes R
Hanocka R
Metzer G
Mitra NJ
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 21/02/2022
Field of study

We present a technique for visualizing point clouds using a neural network. Our technique allows for an instant preview of any point cloud, and bypasses the notoriously difficult surface reconstruction problem or the need to estimate oriented normals for splat-based rendering. We cast the preview problem as a conditional image-to-image translation task, and design a neural network that translates point depth-map directly into an image, where the point cloud is visualized as though a surface was reconstructed from it. Furthermore, the resulting appearance of the visualized point cloud can be, optionally, conditioned on simple control variables (e.g., color and light). We demonstrate that our technique instantly produces plausible images, and can, on-the-fly effectively handle noise, non-uniform sampling, and thin surfaces sheets

arXiv.org e-Print Archive

UCL Discovery